a31ba94b933be2190a4c90b611056a1897730cfe
front devel test 4 base
- challenge
- "He Said She Said" classification challenge (2nd edition)
- submitter
- devel
- submitted
- 2023-11-03 11:50:59.429639 UTC
- file basename
- out
dev-0 / 2e70a0bcc6bb7c4401aeea2e7c72e685ef40ee9c
| Metric | Score |
|---|---|
| Likelihood | 0.00000 |
| Accuracy | 0.52509 |
| Likelihood | Accuracy | |
|---|---|---|
| +H | 0.00000 | 0.48500 |
| +C | 1.00000 | 1.00000 |
| -C | 0.00000 | 0.52509 |
worst items
note: the gold standard is taken from the submission itself, not from the challenge data!| # | input | expected output | actual output | dev-1 Likelihood +C |
|---|---|---|---|---|
| 1 | Cierpiałem na straszne lagi – kilkanaście sekund lub dłużej czarnego ekranu przy próbie przełączenia się / uruchomienia prawie każdej aplika… | 1 | 1 | 1.00000 |